Search CORE

24 research outputs found

Real Time Stereo Image Registration for Planar Structure and 3D Sensor Pose Estimation

Author: Angel D. Sappa
Fadi Dornaika
Publication venue: 'IntechOpen'
Publication date: 01/11/2008
Field of study

IntechOpen

Simulated Annealing: A Novel Application of Image Processing in the Wood Area

Author: Aguilera Cristhian A.
Ramos Mario A.
Sappa Angel D.
Publication venue: 'IntechOpen'
Publication date: 29/08/2012
Field of study

IntechOpen

Tiny and Efficient Model for the Edge Detection Generalization

Author: Li Yachuan
Rouhani Mohammad
Sappa Angel D.
Soria Xavier
Publication venue
Publication date: 12/08/2023
Field of study

Most high-level computer vision tasks rely on low-level image operations as their initial processes. Operations such as edge detection, image enhancement, and super-resolution, provide the foundations for higher level image analysis. In this work we address the edge detection considering three main objectives: simplicity, efficiency, and generalization since current state-of-the-art (SOTA) edge detection models are increased in complexity for better accuracy. To achieve this, we present Tiny and Efficient Edge Detector (TEED), a light convolutional neural network with only

58K

parameters, less than

0.2

% of the state-of-the-art models. Training on the BIPED dataset takes

less than 30 minutes

, with each epoch requiring

less than 5 minutes

. Our proposed model is easy to train and it quickly converges within very first few epochs, while the predicted edge-maps are crisp and of high quality. Additionally, we propose a new dataset to test the generalization of edge detection, which comprises samples from popular images used in edge detection and image segmentation. The source code is available in https://github.com/xavysp/TEED.Comment: To Appear in ICCV 202

arXiv.org e-Print Archive

A Novel Domain Transfer-Based Approach for Unsupervised Thermal Image Super-Resolution

Author: Hammoud Riad
Rivadeneira Rafael E.
Sappa Angel D.
Vintimilla Boris X.
Publication venue: 'MDPI AG'
Publication date: 01/01/2022
Field of study

This paper presents a transfer domain strategy to tackle the limitations of low-resolution thermal sensors and generate higher-resolution images of reasonable quality. The proposed technique employs a CycleGAN architecture and uses a ResNet as an encoder in the generator along with an attention module and a novel loss function. The network is trained on a multi-resolution thermal image dataset acquired with three different thermal sensors. Results report better performance benchmarking results on the 2nd CVPR-PBVS-2021 thermal image super-resolution challenge than state-of-the-art methods. The code of this work is available online

Multidisciplinary Digital Publishing Institute

DSpace@MIT

Directory of Open Access Journals

PubMed Central

Diposit Digital de Documents de la UAB

Editorial: special issue on autonomous driving and driver assistance systems

Author: de la Escalera Arturo
Oliveira Miguel
Santos Vítor
Sappa Angel D.
Publication venue: 'Elsevier BV'
Publication date: 01/11/2019
Field of study

No abstract availablepublishe

Repositório Institucional da Universidade de Aveiro

Non-Rigid Registration meets Surface Reconstruction

Author: Boyer Edmond
Rouhani Mohammad
Sappa Angel D.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 08/12/2014
Field of study

International audienceNon rigid registration is an important task in computer vision with many applications in shape and motion modeling. A fundamental step of the registration is the data association between the source and the target sets. Such association proves difficult in practice, due to the discrete nature of the information and its corruption by various types of noise, e.g. outliers and missing data. In this paper we investigate the benefit of the implicit representations for the non-rigid registration of 3D point clouds. First, the target points are described with small quadratic patches that are blended through partition of unity weighting. Then, the discrete association between the source and the target can be replaced by a continuous distance field induced by the interface. By combining this distance field with a proper deformation term, the registration energy can be expressed in a linear least square form that is easy and fast to solve. This significantly eases the registration by avoiding direct association between points. Moreover, a hierarchical approach can be easily implemented by employing coarse-to-fine representations. Experimental results are provided for point clouds from multi-view data sets. The qualitative and quantitative comparisons show the outperformance and robustness of our framework

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Implicit B-Spline Surface Reconstruction

Author: Angel D. Sappa
Edmond Boyer
Mohammad Rouhani
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

An Iterative Multiresolution Scheme for SFM with Missing Data

Author: A. Buchanan
Angel D. Sappa
Antonio López
B. Triggs
C. Tomasi
C.J. Poelman
Carme Julià
D.W. Jacobs
Felipe Lumbreras
H. Aanaes
J.P. Costeira
Joan Serrat
M. Fischler
M. Han
N. Guilbert
P. Chen
T. Morita
T. Okatani
Y. Ma
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Multimodal interaction in image and video applications

Author: Sappa Angel D
Vitrià Jordi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Traditional Pattern Recognition (PR) and Computer Vision (CV) technologies have mainly focused on full automation, even though full automation often proves elusive or unnatural in many applications, where the technology is expected to assist rather than replace the human agents. However, not all the problems can be automatically solved being the human interaction the only way to tackle those applications. Recently, multimodal human interaction has become an important field of increasing interest in the research community. Advanced man-machine interfaces with high cognitive capabilities are a hot research topic that aims at solving challenging problems in image and video applications. Actually, the idea of computer interactive systems was already proposed on the early stages of computer science. Nowadays, the ubiquity of image sensors together with the ever-increasing computing performance has open new and challenging opportunities for research in multimodal human interaction. This book aims to show how existing PR and CV technologies can naturally evolve using this new paradigm. The chapters of this book show different successful case studies of multimodal interactive technologies for both image and video applications. They cover a wide spectrum of applications, ranging from interactive handwriting transcriptions to human-robot interactions in real environments

CERN Document Server